Import modules

Load Data

Merge Data

EDA (Elementary Data Analysis)

High Level Statistics

Observation :

Observation :

Observation :

Observation :

Observation :

Data Imbalance

Observation :

Check for missing values

Observation :

Feature Analysis

_id

Observation :

Categorical Analysis

type

Observation :

horse

marketId

Observation :

IP (missing value 3)

Feature Engineering

eventType

userName

selectionName

marketName

event

winnerId

Numerical Features

stake

Observation :

Lets do more analysis.

Log Transform

Observation :

boxcox Transform

betRate

Log transform

Boxcox

averagePriceMatched

Log transform

BosCox Transformation

placedDate

Date

Time

hour

week of the year

weekday

day_name (7 week days)

-----

TimeSeries Plot Generated By Tableau

Bivariate Analysis

Correlation Analysis

Compute pairwise correlation of columns, excluding NA/null values.

Data Standardization

Train test spit

Feature Selection

For Numerical Data

For Categorical Data

Build Data Matrix for Models

TSNE Visualization

Models

Random Model

Logistic Regression (SGD)

Logistic Regression (Sklearn)

Decision Tree Model

Random Forest Model

XGBoost Model

Neural Network (ANN) Model